Created: 2026-03-06 07:53:04
Updated: 2026-03-06 07:53:04
最重要的连续符号信道是高斯信道。设输入随机变量为X i X_{i} X i ,输出
Y i = X i + Z i , Z i ∼ N ( 0 , N ) Y_{i} = X_{i}+Z_{i},\qquad Z_{i} \sim \mathscr{N}(0,N) Y i = X i + Z i , Z i ∼ N ( 0 , N )
Z i Z_{i} Z i 是信号X i X_{i} X i 的噪声。 如果噪声方差为0,那么所有实数X都可被精确传输到对端;如果噪声方差非零,我们可以选取一组有限集合作为输入,从而使得输出端的分辨具有任意小的错误率,信道从而有有限的容量。如果噪声方差为0,那么信道容量为无穷。 最常见的输入限制是功率限制。假设平均功率限制为
1 n ∑ i = 1 n x i 2 ≤ P \frac{1}{n} \sum_{i=1}^n x_{i}^2 \leq P n 1 i = 1 ∑ n x i 2 ≤ P
假设希望每次发送1bit,以P \sqrt{ P } P 和− P -\sqrt{ P } − P 来发送。接收方接收Y后试图找出哪个是发送的符号。显然最优的方式是看Y的正负性。此时错误率为:
P e = 1 2 Pr ( Y < 0 ∣ X = + P ) + 1 2 Pr ( Y > 0 ∣ X = − P ) = 1 2 Pr ( Z < − P ∣ X = + P ) + 1 2 Pr ( Z > P ∣ X = − P ) = Pr ( Z > P ) = 1 − Φ ( P N ) \begin{align}
P_{e} & = \frac{1}{2} \text{Pr}(Y<0 | X=+\sqrt{ P }) + \frac{1}{2} \text{Pr} (Y>0| X=-\sqrt{ P }) \\
& = \frac{1}{2} \text{Pr}(Z< -\sqrt{ P } | X=+\sqrt{ P}) + \frac{1}{2} \text{Pr} (Z> \sqrt{ P } | X=- \sqrt{ P }) \\
& = \text{Pr}(Z>\sqrt{ P }) \\
& = 1-\Phi \left( \sqrt{ \frac{P}{N} } \right)
\end{align} P e = 2 1 Pr ( Y < 0∣ X = + P ) + 2 1 Pr ( Y > 0∣ X = − P ) = 2 1 Pr ( Z < − P ∣ X = + P ) + 2 1 Pr ( Z > P ∣ X = − P ) = Pr ( Z > P ) = 1 − Φ ( N P )
where
Φ ( x ) = ∫ − ∞ x 1 2 π e − t 2 / 2 d t \Phi(x) = \int _{-\infty }^x \frac{1}{\sqrt{2\pi }} e^{-t^2/2} \, dt Φ ( x ) = ∫ − ∞ x 2 π 1 e − t 2 /2 d t
类似的,我们还可以把幅度分为4份/8份等等,分的份数越多每次传输信息越多,但是误码率也越大。
具有功率限制P的高斯信道的信息容量(Information Capacity)定义为:
C = max p ( x ) : E X 2 ≤ P I ( X ; Y ) C= \max _{p(x) : EX^2 \leq P} I(X;Y) C = p ( x ) : E X 2 ≤ P max I ( X ; Y )
I ( X ; Y ) = h ( Y ) − h ( Y ∣ X ) = h ( Y ) − h ( X + Z ∣ X ) = h ( Y ) − h ( Z ∣ X ) = h ( Y ) − h ( Z ) \begin{align}
I(X;Y) & = h(Y) - h(Y|X) \\
& = h(Y) - h(X+Z|X) \\
& = h(Y)-h(Z|X) \\
& = h(Y)-h(Z)
\end{align} I ( X ; Y ) = h ( Y ) − h ( Y ∣ X ) = h ( Y ) − h ( X + Z ∣ X ) = h ( Y ) − h ( Z ∣ X ) = h ( Y ) − h ( Z )
由于Z和X独立,h ( Z ) = 1 2 log 2 π e N h(Z) = \frac{1}{2}\log 2\pi eN h ( Z ) = 2 1 log 2 π e N
E Y 2 = E ( X + Z ) 2 = P + N EY^2 = E(X+Z)^2 = P+N E Y 2 = E ( X + Z ) 2 = P + N
由之前结论,Y Y Y 的熵被限制在1 2 log 2 π e ( P + N ) \frac{1}{2} \log 2\pi e(P+N) 2 1 log 2 π e ( P + N ) 之内。 从而
I ( X ; Y ) = h ( Y ) − h ( Z ) ≤ 1 2 log 2 π e ( P + N ) − 1 2 log 2 π e N = 1 2 log ( 1 + P N ) \begin{align}
I(X;Y) & = h(Y)-h(Z) \\
& \leq \frac{1}{2}\log 2\pi e(P+N) - \frac{1}{2}\log 2\pi eN \\
& = \frac{1}{2}\log \left( 1+\frac{P}{N} \right)
\end{align} I ( X ; Y ) = h ( Y ) − h ( Z ) ≤ 2 1 log 2 π e ( P + N ) − 2 1 log 2 π e N = 2 1 log ( 1 + N P )
C = max E X 2 ≤ P I ( X ; Y ) = 1 2 log ( 1 + P N ) C = \max _{EX^2\leq P} I(X;Y) = \frac{1}{2} \log \left( 1+\frac{P}{N} \right) C = E X 2 ≤ P max I ( X ; Y ) = 2 1 log ( 1 + N P )
取得最大值条件为X ∼ N ( 0 , P ) X\sim \mathscr{N}(0,P) X ∼ N ( 0 , P )
这个capacity还是信道可达到的最大传输速率。
直观描述:发送码长为n时功率限制为:
x 1 2 + x 2 2 + ⋯ + x n 2 = n P x_{1}^2+x_{2}^2 +\dots +x_{n}^2 = nP x 1 2 + x 2 2 + ⋯ + x n 2 = n P
于是所有点落在n P \sqrt{ nP } n P 的n维球面内,设球体体积公式V n ( r ) = A n r n V_{n}(r) = A_{n}r^n V n ( r ) = A n r n 此时噪声的作用相当于将球内的点变到附近半径n N \sqrt{ nN } n N 范围内。 为了实现准确传输,大球所能容纳的小球个数
num = A n ( n ( P + N ) ) n A n ( n N ) n = 2 1 / 2 log ( 1 + P / N ) \text{num} = \frac{A_{n}(\sqrt{ n(P+N) })^n}{A_{n} (\sqrt{ nN })^n} = 2^{1/2 \log (1+P/N)} num = A n ( n N ) n A n ( n ( P + N ) ) n = 2 1/2 l o g ( 1 + P / N )
Band limited channels
A common model for communication over a radio network or a telephone line is a band-limited channel with white noise.这是一种连续时间信道。输入和输出之间的关系为
Y ( t ) = ( X ( t ) + Z ( t ) ) ∗ h ( t ) Y(t) = (X(t)+Z(t)) *h(t) Y ( t ) = ( X ( t ) + Z ( t )) ∗ h ( t )
X ( t ) X(t) X ( t ) 是信号波形,Z ( t ) Z(t) Z ( t ) 是高斯白噪声波形,h ( t ) h(t) h ( t ) 为理想带通滤波器的阶跃响应,它将所有高于频率W W W 的频率全部去除。我们依据简单论证给出这种信道的容量。
Theorem 10.3.1 设信号f ( t ) f(t) f ( t ) 带宽被限制在W内,即f ( t ) f(t) f ( t ) 的频谱在大于W的部分为0.那么f ( t ) f(t) f ( t ) 函数完全由每隔 1 2 W \frac{1}{2W} 2 W 1 秒对该函数的采样点决定。
f ( t ) = 1 2 π ∫ − ∞ ∞ f ( ω ) e i ω t d ω = 1 2 π ∫ − 2 π W 2 π W F ( ω ) e i ω t d ω f ( n 2 W ) = 1 2 π ∫ − 2 π W 2 π W F ( ω ) e i ω n / 2 W d ω \begin{align}
f(t) & = \frac{1}{2\pi} \int _{-\infty}^\infty f(\omega) e^{i\omega t} d\omega \\
& = \frac{1}{2\pi} \int _{-2\pi W}^{2\pi W} F(\omega)e^{i\omega t} \, d\omega \\ \\
f\left( \frac{n}{2W} \right) & = \frac{1}{2\pi} \int _{-2\pi W}^{2\pi W} F(\omega) e^{i\omega n/2W} \, d\omega
\end{align} f ( t ) f ( 2 W n ) = 2 π 1 ∫ − ∞ ∞ f ( ω ) e iω t d ω = 2 π 1 ∫ − 2 πW 2 πW F ( ω ) e iω t d ω = 2 π 1 ∫ − 2 πW 2 πW F ( ω ) e iωn /2 W d ω
左侧即为F ( ω ) F(\omega) F ( ω ) 的傅里叶展开系数(假设对F ( ω ) F(\omega) F ( ω ) 进行了周期延拓,是个周期函数,− 2 π W ≤ ω ≤ 2 π W -2\pi W\leq\omega\leq 2\pi W − 2 πW ≤ ω ≤ 2 πW ) 即
F ( ω ) = ∑ n = − ∞ ∞ f ( n 2 W ) e − i n 2 W ω − 2 π W ≤ ω ≤ 2 π W F(\omega) = \sum_{n=-\infty}^\infty f\left( \frac{n}{2W}\right) e^{-i \frac n {2W} \omega} \qquad -2\pi W\leq\omega\leq 2\pi W F ( ω ) = n = − ∞ ∑ ∞ f ( 2 W n ) e − i 2 W n ω − 2 πW ≤ ω ≤ 2 πW
F ( ω ) F(\omega) F ( ω ) 唯一确定,从而函数f ( x ) f(x) f ( x ) 是唯一确定的。
考虑函数sinc ( t ) = sin ( 2 π W t ) 2 π W t \text{sinc}(t) = \frac{\sin(2\pi Wt)}{2\pi Wt} sinc ( t ) = 2 πW t s i n ( 2 πW t ) ,这个函数在t=0时是1,t = n 2 W t=\frac{n}{2W} t = 2 W n 为0,谱只分布在( − W , W ) (-W,W) ( − W , W ) 内且为常数。 定义
g ( t ) = ∑ n = − ∞ ∞ f ( n 2 W ) sinc ( t − n 2 W ) g(t) = \sum_{n=-\infty}^\infty f\left( \frac{n}{2W}\right) \text{sinc}\left( t-\frac{n}{2W} \right) g ( t ) = n = − ∞ ∑ ∞ f ( 2 W n ) sinc ( t − 2 W n )
根据sinc函数特性,g ( t ) g(t) g ( t ) 谱限制在W内,且g ( n 2 W ) = f ( n 2 W ) g\left( \frac{n}{2W} \right)=f(\frac{n}{2W}) g ( 2 W n ) = f ( 2 W n ) ,因此g ( t ) = f ( t ) g(t)=f(t) g ( t ) = f ( t ) 。这是根据采样值显式表达f ( t ) f(t) f ( t ) 的方式。
如果函数局限在一个频带内,那么它在时域内就是非局限的。单我们可以考虑那些绝大多数能量在带宽W内、同时绝大多数能量在有限时间段内,例如( 0 , T ) (0,T) ( 0 , T ) 。我们可以通过一个叫做prolate spheroidal functions。简而言之,大约有2 T W 2TW 2 T W 个正交基满足:几乎局限在某一时域和某一频域内。我们可以用这组基底表述任意满足该条件的函数。 Noise:Independent and Gaussian,每个样点都是独立同分布的高斯型随机变量。若噪声有功率谱密度N 0 2 \frac{N_{0}}{2} 2 N 0 ,带宽W W W ,那么噪声功率为N 0 2 × 2 W = N 0 W \frac{N_{0}}{2}\times 2W=N_{0}W 2 N 0 × 2 W = N 0 W ,T T T 时间内所有2 W T 2WT 2 W T 个噪声样点具有方差N 0 W T 2 W T = N 0 2 \frac{N_{0}WT}{2WT}=\frac{N_{0}}{2} 2 W T N 0 W T = 2 N 0 。将输入看成是2 T W 2TW 2 T W 维向量空间的元素,我们看到接收信号球状正态分布在该点附近,协方差N 0 2 I \frac{N_{0}}{2}I 2 N 0 I 于是可以利用离散时间高斯信道的结论C = 1 2 log ( 1 + P N ) C=\frac{1}{2}\log (1+\frac{P}{N}) C = 2 1 log ( 1 + N P ) 。在[ 0 , T ] [0,T] [ 0 , T ] 时间段内使用信道,此时每个样点功率P T 2 W T = P 2 W \frac{PT}{2WT}=\frac{P}{2W} 2 W T PT = 2 W P ,每个样点噪声方差N 0 2 2 W T 2 W T = N 0 2 \frac{N_{0}}{2}2W \frac{T}{2WT}=\frac{N_{0}}{2} 2 N 0 2 W 2 W T T = 2 N 0 ,于是每个样本点的信息容量
C = 1 2 log ( 1 + P 2 W N 0 2 ) = 1 2 log ( 1 + P N 0 W ) bits/sample C=\frac{1}{2}\log \left( 1+ \frac {\frac{P}{2W}}{\frac{N_{0}}{2}} \right) = \frac{1}{2}\log \left( 1+\frac{P}{N_{0}W} \right)\ \ \text{bits/sample} C = 2 1 log ( 1 + 2 N 0 2 W P ) = 2 1 log ( 1 + N 0 W P ) bits/sample
由于每秒有2 W 2W 2 W 个样本点,信道信息传输速率
C = W log ( 1 + P N 0 W ) bit/s C = W \log \left( 1+ \frac{P}{N_{0}W} \right) \ \ \ \text{bit/s} C = W log ( 1 + N 0 W P ) bit/s
It gives the capacity of a band-limited Gaussian channel with noise spectral density N 0 2 watts/Hz \frac{N_{0}}{2} \text{watts/Hz} 2 N 0 watts/Hz and power P watts P\ \text{watts} P watts
Let W → ∞ W\to \infty W → ∞ , C = P N 0 log 2 e bit/s C=\frac{P}{N_{0}} \log_{2}e \quad\text{bit/s} C = N 0 P log 2 e bit/s
10.4 Parallel Gaussian Channels
假设有多个高斯信道,每个信道噪声N i N_{i} N i ,要求功率和限制为P P P 。设给每个信道分配功率P i P_{i} P i ,于是问题成为优化问题:最大化∑ 1 2 log ( 1 + P i N i ) \sum \frac{1}{2}\log (1+\frac{P_{i}}{N_{i}}) ∑ 2 1 log ( 1 + N i P i ) ,同时拥有限制∑ i P i = P \sum_{i}P_{i}=P ∑ i P i = P
J ( P i ) = ∑ i 1 2 log ( 1 + P i N i ) + λ ( ∑ P i ) J(P_{i}) = \sum_{i} \frac{1}{2} \log \left( 1+ \frac{P_{i}}{N_{i}} \right) + \lambda \left( \sum P_{i} \right) J ( P i ) = i ∑ 2 1 log ( 1 + N i P i ) + λ ( ∑ P i )
1 2 1 P i + N i + λ = 0 , P i = ν − N i for constant ν \frac{1}{2} \frac{1}{P_{i}+N_{i}} + \lambda = 0,\qquad P_{i} = \nu-N_{i}\ \text{ for constant }\nu 2 1 P i + N i 1 + λ = 0 , P i = ν − N i for constant ν
由于功率非负,可能找不到这种形式的解。最终,我们可以使用watter-filling的方式找出功率分配的最佳方式,如图所示。
10.5 Channels with Colored Gaussian Noise
现在考虑噪声之间具有依赖的情形。这不仅表示并行信道的情形,也表示一个具有记忆的高斯噪声的信道。对于有记忆的信道,我们考虑一组连续使用n次信道。这也可看作n个信道平行使用,而噪声是相互关联的。
令K Z K_{Z} K Z 是噪声的协方差矩阵,K X K_{X} K X 是输入协方差矩阵。输入功率限制为
1 n ∑ i E X i 2 ≤ P \frac{1}{n}\sum_{i}EX_{i}^2 \leq P n 1 i ∑ E X i 2 ≤ P
或等价地:
1 n t r ( K X ) ≤ P \frac{1}{n} tr(K_{X})\leq P n 1 t r ( K X ) ≤ P
I ( X 1 , … , X n ; Y 1 , … , Y n ) = h ( Y 1 , … , Y n ) − h ( Z 1 , … , Z n ) I(X_{1},\dots,X_{n};Y_{1},\dots,Y_{n}) = h(Y_{1},\dots,Y_{n})-h(Z_{1},\dots,Z_{n}) I ( X 1 , … , X n ; Y 1 , … , Y n ) = h ( Y 1 , … , Y n ) − h ( Z 1 , … , Z n )
当输入X是正态分布时输出Y也是正态分布,这时Y的熵最大。由于输入和噪声独立,Y Y Y 的协方差矩阵K Y = K X + K Z K_{Y}=K_{X}+K_{Z} K Y = K X + K Z ,熵为
h ( Y 1 , … , Y n ) = 1 2 log ( ( 2 π e ) n ∣ K X + K Z ∣ ) h(Y_{1},\dots,Y_{n}) = \frac{1}{2} \log ((2\pi e)^n |K_{X}+K_{Z}|) h ( Y 1 , … , Y n ) = 2 1 log (( 2 π e ) n ∣ K X + K Z ∣ )
于是问题化为选择K X K_{X} K X ,使h ( Y 1 , … , Y n ) h(Y_{1},\dots,Y_{n}) h ( Y 1 , … , Y n ) 最大,也就是∣ K X + K Z ∣ | K_{X}+K_{Z}| ∣ K X + K Z ∣ 最大。对K Z K_{Z} K Z 正交对角化:K Z = Q Λ Q t K_{Z}=Q\Lambda Q^t K Z = Q Λ Q t
∣ K X + K Z ∣ = ∣ K X + Q Λ Q t ∣ = ∣ Λ + Q t K X Q ∣ ≡ ∣ Λ + A ∣ | K_{X}+K_{Z}| = |K_{X}+Q\Lambda Q^t| = |\Lambda+Q^tK_{X}Q| \equiv|\Lambda+A| ∣ K X + K Z ∣ = ∣ K X + Q Λ Q t ∣ = ∣Λ + Q t K X Q ∣ ≡ ∣Λ + A ∣
容易证明t r ( A ) = t r ( K X ) = n P tr(A)=tr(K_{X})=nP t r ( A ) = t r ( K X ) = n P 。 使用第九章中的Hadamard's Inequality:任意正定矩阵K的行列式小于等于它的对角线元素之积:
∣ K ∣ ≤ ∏ i K i i | K| \leq \prod_{i}K_{ii} ∣ K ∣ ≤ i ∏ K ii
去等当且仅当K对角。于是∣ A + Λ ∣ ≤ ∏ i ( A i i + λ i ) |A+\Lambda|\leq \prod_{i}(A_{ii}+\lambda_{i}) ∣ A + Λ∣ ≤ ∏ i ( A ii + λ i ) ,取等当且仅当A A A 对角。由于A有trace限制,连乘积在A i i + λ i = ν A_{ii}+\lambda_{i}=\nu A ii + λ i = ν 时取得最大值。 同样,由于限制,并不是总能同时满足A i i > 0 A_{ii}> 0 A ii > 0 和该条件。此时,通过标准Kuhn-Tucker条件,最优的选择对应于选择
A i i = ( ν − λ i ) + , choose ν s.t. ∑ i A i i = n P A_{ii}= (\nu-\lambda_{i})^+, \qquad \text{choose }\nu\ \ \text{s.t.}\sum_{i}A_{ii}=nP A ii = ( ν − λ i ) + , choose ν s.t. i ∑ A ii = n P
Consider a channel in which the additive Gaussian noise forms stochastic process with finite dimensional covariance matrix K Z ( n ) K_{Z}^{(n)} K Z ( n ) . If the process is stationary,then the covariance matrix is Toeplitz(每条左上-右下的线上所有元素相等) and the eigenvalues tend to limit as n → ∞ n\to \infty n → ∞ . The density of eigenvalues on the real line tends to the power spectrum of the stochastic process. In this case, the above 'water-filling' argument translates to watter-filling in the spectral domain.
Hense for channels in which the noise forms a stationary stochastic process, the input signal should be chosen to be a Gaussian process with a spectrum which is large at frequencies where the noise spectrum is small. The capacity of an additive Gaussian noise channel with noise power spectrum N(f) can be shown to be
C = ∫ 1 2 log ( 1 + ( ν + N ( f ) ) + N ( f ) ) d f C=\int \frac{1}{2}\log \left( 1+ \frac{(\nu+N(f))^+}{N(f)} \right) \, df C = ∫ 2 1 log ( 1 + N ( f ) ( ν + N ( f ) ) + ) df
Leave a Comment